Overview

Dataset Statistics

Number of Variables 17
Number of Rows 17379
Missing Cells 3568
Missing Cells (%) 1.2%
Duplicate Rows 0
Duplicate Rows (%) 0.0%
Total Size in Memory 4.1 MB
Average Row Size in Memory 246.7 B
Variable Types
  • Numerical: 10
  • Categorical: 7

Dataset Insights

instant is uniformly distributed Uniform
temp has 1784 (10.27%) missing values Missing
atemp has 1784 (10.27%) missing values Missing
casual is skewed Skewed
registered is skewed Skewed
cnt is skewed Skewed
dteday has a high cardinality: 731 distinct values High Cardinality
dteday has constant length 10 Constant Length
season has constant length 1 Constant Length
yr has constant length 1 Constant Length
holiday has constant length 1 Constant Length
weekday has constant length 1 Constant Length
weathersit has constant length 1 Constant Length
windspeed has 2180 (12.54%) zeros Zeros
casual has 1581 (9.1%) zeros Zeros
  • 1
  • 2

Variables


instant

numerical

Approximate Distinct Count 17379
Approximate Unique (%) 100.0%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 278064
Mean 8690
Minimum 1
Maximum 17379
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • instant is uniformly distributed

Quantile Statistics

Minimum 1
5-th Percentile 869.9
Q1 4345.5
Median 8690
Q3 13034.5
95-th Percentile 16510.1
Maximum 17379
Range 17378
IQR 8689

Descriptive Statistics

Mean 8690
Standard Deviation 5017.0295
Variance 2.5171e+07
Sum 1.5102e+08
Skewness 0
Kurtosis -1.2
Coefficient of Variation 0.5773
  • instant is not normally distributed (p-value 0.0)

dteday

categorical

Approximate Distinct Count 731
Approximate Unique (%) 4.2%
Missing 0
Missing (%) 0.0%
Memory Size 1303425

Length

Mean 10
Standard Deviation 0
Median 10
Minimum 10
Maximum 10

Sample

1st row 2011-01-01
2nd row 2011-01-01
3rd row 2011-01-01
4th row 2011-01-01
5th row 2011-01-01

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 34758
Decimal Number 139032
  • dteday has words of constant length

season

categorical

Approximate Distinct Count 4
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 1147014

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 1
2nd row 1
3rd row 1
4th row 1
5th row 1

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 17379
  • The top 2 categories (3, 2) take over 50.0%
  • season has words of constant length

yr

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 1147014

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 0
2nd row 0
3rd row 0
4th row 0
5th row 0

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 17379
  • The top 2 categories (1, 0) take over 50.0%
  • yr has words of constant length

mnth

numerical

Approximate Distinct Count 12
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 278064
Mean 6.5378
Minimum 1
Maximum 12
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • mnth is skewed left (γ1 = -0.0093)

Quantile Statistics

Minimum 1
5-th Percentile 1
Q1 4
Median 7
Q3 10
95-th Percentile 12
Maximum 12
Range 11
IQR 6

Descriptive Statistics

Mean 6.5378
Standard Deviation 3.4388
Variance 11.8252
Sum 113620
Skewness -0.009252
Kurtosis -1.2019
Coefficient of Variation 0.526
  • mnth is not normally distributed (p-value 0.003478795940767677)

hr

numerical

Approximate Distinct Count 24
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 278064
Mean 11.5468
Minimum 0
Maximum 23
Zeros 726
Zeros (%) 4.2%
Negatives 0
Negatives (%) 0.0%
  • hr is skewed left (γ1 = -0.0107)

Quantile Statistics

Minimum 0
5-th Percentile 1
Q1 5
Median 12
Q3 18
95-th Percentile 22
Maximum 23
Range 23
IQR 13

Descriptive Statistics

Mean 11.5468
Standard Deviation 6.9144
Variance 47.809
Sum 200671
Skewness -0.01068
Kurtosis -1.198
Coefficient of Variation 0.5988
  • hr is not normally distributed (p-value 1.6721506816747017e-198)

holiday

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 1147014
  • The largest value (0) is over 33.76 times larger than the second largest value (1)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 0
2nd row 0
3rd row 0
4th row 0
5th row 0

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 17379
  • The top 2 categories (0, 1) take over 50.0%
  • The largest value (0) is over 33.76 times larger than the second largest value (1)
  • holiday has words of constant length

weekday

categorical

Approximate Distinct Count 7
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 1147014

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 6
2nd row 6
3rd row 6
4th row 6
5th row 6

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 17379
  • weekday has words of constant length

workingday

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 1176258
  • The largest value (Yes) is over 2.15 times larger than the second largest value (No)

Length

Mean 2.6827
Standard Deviation 0.4654
Median 3
Minimum 2
Maximum 3

Sample

1st row No
2nd row No
3rd row No
4th row No
5th row No

Letter

Count 46623
Lowercase Letter 29244
Space Separator 0
Uppercase Letter 17379
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (Yes, No) take over 50.0%
  • The largest value (yes) is over 2.15 times larger than the second largest value (no)

weathersit

categorical

Approximate Distinct Count 4
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 1147014
  • The largest value (1) is over 2.51 times larger than the second largest value (2)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 1
2nd row 1
3rd row 1
4th row 1
5th row 1

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 17379
  • The top 2 categories (1, 2) take over 50.0%
  • The largest value (1) is over 2.51 times larger than the second largest value (2)
  • weathersit has words of constant length

temp

numerical

Approximate Distinct Count 50
Approximate Unique (%) 0.3%
Missing 1784
Missing (%) 10.3%
Infinite 0
Infinite (%) 0.0%
Memory Size 249520
Mean 0.4965
Minimum 0.02
Maximum 1
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • temp is skewed left (γ1 = -0.003)

Quantile Statistics

Minimum 0.02
5-th Percentile 0.2
Q1 0.34
Median 0.5
Q3 0.66
95-th Percentile 0.8
Maximum 1
Range 0.98
IQR 0.32

Descriptive Statistics

Mean 0.4965
Standard Deviation 0.1926
Variance 0.03709
Sum 7742.16
Skewness -0.002997
Kurtosis -0.9492
Coefficient of Variation 0.3879
  • temp is not normally distributed (p-value 3.1267749968701293e-10)

atemp

numerical

Approximate Distinct Count 65
Approximate Unique (%) 0.4%
Missing 1784
Missing (%) 10.3%
Infinite 0
Infinite (%) 0.0%
Memory Size 249520
Mean 0.4753
Minimum 0
Maximum 1
Zeros 1
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • atemp is skewed left (γ1 = -0.0864)

Quantile Statistics

Minimum 0
5-th Percentile 0.2121
Q1 0.3333
Median 0.4848
Q3 0.6212
95-th Percentile 0.7424
Maximum 1
Range 1
IQR 0.2879

Descriptive Statistics

Mean 0.4753
Standard Deviation 0.1719
Variance 0.02956
Sum 7412.2991
Skewness -0.08639
Kurtosis -0.8569
Coefficient of Variation 0.3617
  • atemp is not normally distributed (p-value 0.001157299941983817)

hum

numerical

Approximate Distinct Count 89
Approximate Unique (%) 0.5%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 278064
Mean 0.6272
Minimum 0
Maximum 1
Zeros 22
Zeros (%) 0.1%
Negatives 0
Negatives (%) 0.0%
  • hum is skewed left (γ1 = -0.1113)

Quantile Statistics

Minimum 0
5-th Percentile 0.31
Q1 0.48
Median 0.63
Q3 0.78
95-th Percentile 0.93
Maximum 1
Range 1
IQR 0.3

Descriptive Statistics

Mean 0.6272
Standard Deviation 0.1929
Variance 0.03722
Sum 10900.61
Skewness -0.1113
Kurtosis -0.8262
Coefficient of Variation 0.3076
  • hum has 22 outliers

windspeed

numerical

Approximate Distinct Count 30
Approximate Unique (%) 0.2%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 278064
Mean 0.1901
Minimum 0
Maximum 0.8507
Zeros 2180
Zeros (%) 12.5%
Negatives 0
Negatives (%) 0.0%
  • windspeed is skewed right (γ1 = 0.5749)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 0.1045
Median 0.194
Q3 0.2537
95-th Percentile 0.4179
Maximum 0.8507
Range 0.8507
IQR 0.1492

Descriptive Statistics

Mean 0.1901
Standard Deviation 0.1223
Variance 0.01497
Sum 3303.7063
Skewness 0.5749
Kurtosis 0.5903
Coefficient of Variation 0.6436
  • windspeed is not normally distributed (p-value 5.748529825531974e-05)
  • windspeed has 342 outliers

casual

numerical

Approximate Distinct Count 322
Approximate Unique (%) 1.9%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 278064
Mean 35.6762
Minimum 0
Maximum 367
Zeros 1581
Zeros (%) 9.1%
Negatives 0
Negatives (%) 0.0%
  • casual is skewed right (γ1 = 2.499)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 4
Median 17
Q3 48
95-th Percentile 138.1
Maximum 367
Range 367
IQR 44

Descriptive Statistics

Mean 35.6762
Standard Deviation 49.305
Variance 2430.986
Sum 620017
Skewness 2.499
Kurtosis 7.5685
Coefficient of Variation 1.382
  • casual is not normally distributed (p-value 2.958127590121308e-20)
  • casual has 1192 outliers

registered

numerical

Approximate Distinct Count 776
Approximate Unique (%) 4.5%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 278064
Mean 153.7869
Minimum 0
Maximum 886
Zeros 24
Zeros (%) 0.1%
Negatives 0
Negatives (%) 0.0%
  • registered is skewed right (γ1 = 1.5578)

Quantile Statistics

Minimum 0
5-th Percentile 4
Q1 34
Median 115
Q3 220
95-th Percentile 465
Maximum 886
Range 886
IQR 186

Descriptive Statistics

Mean 153.7869
Standard Deviation 151.3573
Variance 22909.028
Sum 2.6727e+06
Skewness 1.5578
Kurtosis 2.7489
Coefficient of Variation 0.9842
  • registered is not normally distributed (p-value 4.231380656204056e-13)
  • registered has 680 outliers

cnt

numerical

Approximate Distinct Count 869
Approximate Unique (%) 5.0%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 278064
Mean 189.4631
Minimum 1
Maximum 977
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • cnt is skewed right (γ1 = 1.2773)

Quantile Statistics

Minimum 1
5-th Percentile 5
Q1 40
Median 142
Q3 281
95-th Percentile 563.1
Maximum 977
Range 976
IQR 241

Descriptive Statistics

Mean 189.4631
Standard Deviation 181.3876
Variance 32901.4611
Sum 3.2927e+06
Skewness 1.2773
Kurtosis 1.4165
Coefficient of Variation 0.9574
  • cnt is not normally distributed (p-value 1.7647639075415398e-14)
  • cnt has 505 outliers

Interactions

Correlations

Missing Values